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Virtually all parametric statistical procedures have 
been shown to be special cases of canonical correlation analy^sis, 
which is a useful research methodology particularly when augmented by 
the calculation of canonical structure, index, and invar iance 
coefficients. A logic for conducting stepwise canonical correlation 
analysis based upon evaluation of canonical com^unality coefficients 
is presented. The coefficients indicate how much of a variable's 
variance is reproducible from the canonical solution. Variables with 
the sma'llest communality coefficients may be deleted in a stepwise 
pro^dure as a direct analogue to stepwise backward multiple 
r^fression analysis. A heuristic demonstration of the technique 
involving two criterion variables and five predictor variables in a 7 
by. 7 correlation matrix is , presented. The application provides more 
insight into the dynamics of social science phenomena, lessens the 
probability- of Type II errors, and provides estimates of the 
general izability of results. (Author/CM) 
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Abstract 

Virtually all parametric statistical procedures have been 
shown to be special cases of canonical correlation analysis/ 
and canonical correlation analysis has been shown to be a 
useful research methodology, particulary when augmented by the 
calculation of canonical structure, index, and' inv^riancfe 
coefficients* This paper presents a logic for conducting 
stepwise canonical correlation analy/sis, based, upon evaluation 
of canonical communality coefficients. A heuristic 
demonstration of the technique is included. 



Analysis of variance (ANOVA) techniques and their 
analogues (ANCOVA, MANOVA, and MANCOVA) have beeYi among the 
most widely used researqh methods employed in the social 
sciences (Edgington, 1974? Willson, 1980). However, these 
techniques may hot be fully appropriate for use in research 
involving multiple independent variables, if some of the 
independent variables are higher thap interval ly scaled. This 
frequently occurs because, as Kerlinger (1979, p. 119) notes, 
most non-manipulated variables, i.e., what Croribach (1957) has 



termed aptitude variables^ tend to be higher than nominally 
scaled. The use of ANOVA techniques in thijs situation can be 
unfortunate, becaus'e ""when we reduce interval level of scale 
data to the nominal level of scale we are doing nothing less 
than thoughtlessly throwing away information which v^e 
previously went to some trouble to collect. If research is 
conducted for the purpose of acquiring knowledge, then is it 
consistent with our purpose to employ a method which throws, 
away' information which might provide a more refined 
understanding of the phenomena which we are studying?" 
(Thompson, 1981, p. 8; "see also Cohen, 1968). 

This logic suggests that researchers should consider more 
frequent use of more general analytic techniques when one or 
more independent variables are higher than nominally scaled. 
When this is the case, and when the researcher is also 
ihvestigati^ng multiple dependent , variables, canonical 
correlation analysis is an appropriate analytic technique 
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(Thompson, 1980aT.' The procedure has Id^eii usefully applied in 
previous studies, (e.g. , Thompson, 19B0b; Thompson & Miller, 
1981; Thompson & Pitts, 1981), and the technique is 
particularly helpful when the analysis is augmented^by the 
calculation of indices such as canonical structure and index 
coefficients (Thompson & PranT^iewitez, .1979) and canonical 
invariance coefficients (Thompson, 1982). Inde^d, Knapp 
(1978, p. 410) ^has noted that "virtually all of the commonly 
encountered parametric tests of significance can be treated as 
special cases of canonical correlation analysis, which is the 
general procedure for investigating the relationships between 
two sets of variables." 

This paper presents a methbd for implementing a new 
extension of the technique, stepwise canonical correlation 
analysis, which may make canonical correlation analysis an 
even more useful! procedure. The procedure is a direct 
analogue of multiple*^ regression analysis. A computer program 
which implements this new canonical technique is available 
from the author? the program was used to generate the results 
presented in this report's heuristic example. 

A Stepwise Canonical Logic 
A canonical structure coefficiervt (Cooley & Lohnes, 1971) 
represents the , correlation between a variable and a canoni'cal 
function. The square of a canonical structure- coefficient 
indicates the proportion of variance which a variable linqarly 
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shares with a canonical function. A variable's canonical 

communality coefficient (Thompson, 1980a, p. 19) is equal to 

the sum of all the variable's squareS canonical structure 

coefficients; the number of .structure coefficients which a 

variable has is equal to the number of variables in the 

smaller of the two variable sets. 

In effect, canonical communality coefficients indicate 
how much of a variable's variance is reproducable from the 
canonical solution. Variables with small „ canonical 
communality coefficients, i.e., coefficients close to zejro, 
contribute little to a canonical, solution. Thus, , variables 
with the smallest conimunality coefficients .may be deleted in a 
stepwise procedure as a direct analogue to stepwise backward 
multiple regression analysis.* 

Stepwise canonical correlation analysis will produce more 
parsimonious results and will conserve degrees of freedom, for 
hypothesis testing. For example, the degrees of freedom for 
testing the canonical correlation associated with the first 
canonical function is equal to the number of variables in each 
variable set times each other. If both variable sets consist 
of five variables, the degrees of freedom for testing the 
statistical significance of the first canonical correlation 
would be 25 (five times five). After stepwise deletion of one 
-variable, the degrees of freedom for the first function would 
then be 20 (four times five). Thu6 the conservation of 
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degrees of freedom can be sizeable, and tends to reduce the 
likelihood of Type II errors occurring as a function of 
variable set sizes. 

0 • 

/ \ 

Heuristic Example 
Pitts, as part .of her dissertation research, performed a 
stepwise canonical correlation analysis employing this 
technique. The theoretical framework and substantive 
implications of her work will not be discussed here (see Pitts 
& Thompson, 1982), but heir statistical analysis will be 
summarized in order to present a concrete example of 
iinplementation of the technique. Her analysis involved two 
criterion variables and five'v predictor variables. The 
seven-by-seven correlation matrix upon which the analysis was 
based is presented in Table 1. . 

INSERT TABLE 1 ABOUT HERE. 

The two canonical functions extracted from the matrix are 
presented in Table 2. „ The squarfed canonical correlation, 
i.e., the eigenvalue, associated with the first function was 
.48 (x^ =84.8, df = 10, ^ ^ .05); the squared canonical 
correlation associated with the second function was .04 
'(X^o = 5.2, df = 4, £ > .05) . 

INSER^ TABLE 2 ABOUT HERE. 



. . PageVS ' 

Although in this case the results would -not have been, 
altered, Pitts« (Note 1) has argued that generally only 
predictor variables should be considered for stepwise 
deletion. This' position is reasonable since researchers are 
generally most interested in understanding criterion father 
than predictqr variables; this" approach is also more 
comparable to ^stepwise regression ^ analysis, since the 
eriterion variable is never "removed" in a stepwise regression 
analysis. However, in some cases the two variable sets, can 
not readiiy be characterized as "criterion" or "preidictor,," 
and in these cases all variables should be given equal 
consideration for deletion. Thus,'- since in this study the 
attentional style variable had the smalles.t canonical 
communality coefficient <.16), the variable was deleted at the 
first step. 

Two canonical functions were extracted from the 
six-by-six correlation matrix produced from the Table 1 matrix 
by ignoring corr^ations involving the deleted variable. 
These results are presented in Table 3. The squared canonical 
correlation associated with the first function was ' .42 
(X^ = 72.6, df * 8, £ < .05); the Sjquared ' canonical 
correlation associated with the second function was .04 
(x^ = 4.6, df = 3, £»> .05). Since the reflective-impulsive 
variable had the smallest canonical communality coefficient at 
this step (.19), ^he variable was deleted from the analysis at 
the end of the second step". 
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INSERT TABLE 3 ABOUT HERE> 
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Two canonical functions were then extracted from the 
five-by-five correlation matrix produced from the Table 1 
maferix *y ignoring the correlations involving the two deleted 
variables. These tesults are presented in Table 4. The 
squared canoniLcal., correlation associated with the first 
function was .^9 ( = 65.8, df * 6,. £ < .,05); the squared 
canonical fiorfrelation associated with the second function 
was .04 (x^ = 4.7, df = 2, £ > .05). Since the canonical 
coiumunality coefficients for the variables left in the 
equation . at the end of this step were relatively homogeneous. 
Stepwise deletion of variables was terminated at step three. 
Thus', the Table 4 results were the resul^ interpreted by the 
researcher. ' Of course, \ only the first function! was 
interpreted since the second function was not statistically 
significant. 

INSERT TABLE 4 ABOUT HERE> 

0 = • 

Conclusions 

Tiie logic recommended here represents an extension of 
conventional canonical correlation analysis. The extension 
makes clear that, as noted by Baggaley (1981), « parametric 
statistical techniques, including stepwise multiple 
regression, are special cases' of canonical correlatioli ' 
analysis. However, the logic should be of more than heuristic" 
value. Its application will provide more insight into the 
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dynamics of social science phenomena, lessen the probability 
of Type o II- errors, and provide estimates of _ the 
general izability of results. As Thorndike (1978, p. 188)'" 
explains, "as the numbjer of variables 'increases, the probable 
'effect of these sources of [error] variation on the canonical 
correlations increases. Therefore, the fewer variables there 
are in a canonical analysis which yields a correlation of a 
"^given magnitude, the greater is the likelihood" that phai 
correlation is due to real, populat^ion-wide sources of 
covariation, rathsr than to ^sample-specif ic sources.." Finally, 
it shpuld be rioted that a forward stepwise canonical analysis 
could also be couched on evaluation qi* canonical communality 
coefficients; at each step the variable which would then have 
the hiqhest structure coefficient would be added to the 
equation! 
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• Reference Notes 






Pitts, ^M.C. Personal communication. 


December 2, 1981. . 
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Table 1 



Correlation Matrix 
(n = 127) 



Variable 



UKBD EASY X3EET 



Ability on Hard (HARD) 

Ability on Easy ( EASY ) 

GEFT (GEPT) 

Field Independence (FI) 

Attentional Style (AS) 
Reflective Impulsive (RI) 

Grade Level (GL) 



.66** 

.15* .31** 
.40** .49** 

-.23** -.22** 
.24** .28** 

..38** .38** 



-FI 



AS 



RI .OL 



.34** 

-.16* .00 

.14 .08 

.29** .06 



-.06 
-.03 



.17* -- 



**r 



.05 
.01 



Table 2 

GancMiical Solution for First Stqp 
(n = 127) 



Vciriable 


Function I 


Punctidh II 


h2 


F 


S 


SSQ 


F 


S 


SSQ 


Ability on Hard ' 


.43 


.87 


.75 


-1.26 


-.50 


.25 


1.00 


Ability exi Easy 


.66 


.95 


.89 


1.16 


.33 


.11 


1.00 


GEPT 


-.10 


.39 


.15 


1.02 


.83 


.70 


.84 


Field Independence 


.69 


.71 


.51 


.00 


.32 


.10 


.61 


Attentional Style 
Reflective Iiqculsive 


-.34 


-.36 


.13 


.31 


.16 


.03 


.16 


.26 


.41 


..17 


.02 


.06 


.00 


.18 


Grade Level 


.53 


.60 


.36 


- -.49 


-.20 


.04 


.39 



NOTES "F" = canonical fimction coefficients; • "S" . = cancmcal structure 
coefficients; "SSQ" = squared canonical structure coefficients; "h " 
= cemcHiicsLL cannunedity corff icients. 



Tabie 3 



C^oionical Solution for Secsond Step 
(n = 127) 



Variable 



Ability on Hard 
Ability on Easy 

GEPT 

Field Indqpendenoe 
Ref tective Iinpulsi^^ 
Grade Level 



Function I 


Function II 


h2 


P 


S 


SSQ 


P 


S 


SSQ 


.39 


.85 


.72 


-1.27 


-.52 


.28 


1.00 


.70 


.96 


.92 


1.13 


.29 


.08 


1.00 


-.02 


.42 


.18 


1.03 


.83 


.69 


.86 


.71 


.76 


.58 


-.07 


. .24 


.06 


.64 


.29 


.44 


.19 


-.03 


.01 


,.00 


.19 


.54 


.63 


.40 


-.58 


-.29 


.08 


.50 



NOTES "F" = cancxiical function coefficients; "S" = carexiical structure 
coefficients; °"SSO" = squared cancxiical structure coefficients; 
"ir" = canaiical connunality coefficients. 



Table. 4 

-Pina± ^Snonical Solution 
(n = 127) 





Function I 




Function II 






Varicible 


F 


I 


S 


SSQ 


F 


I 


S 


SSQ 


h2 


Ability on Hard 
Ability on Easy 


.38 
.70 


.53 
.60 


.85 
.96 


.72 
.92 


-1.27 
1.13 


-.10 
.06 


-.53 
.29 


.28 
.08 


'l.OO 
1.00 


Field Independence 
Grade Level 


.00 
.76 
.61 


.28 
.50 
.41 


.44 
.79 
.66 


.19 
.63 
.43 


1.02 
-.08 
-.59 


.16 
.05 
-.06 


.82 
.24 
-.29 


.68 
.06 
.09 


.87 
.68 
.52 



NOTE: "F" = cancxdcal function coefficients; "I" = canonical' index 
coefficients; "S" = canonical structure ooeffjcients; "SSQ" = 
squared canonical structure coefficients; "h " = canonical 
conmunedity coefficients. 



